Some Recent Results on Ranking Webpages and Websites

نویسندگان

  • Ying Bao
  • Zhi-Ming Ma
  • Yan-Hong Shang
چکیده

In this paper we briefly review some of our recent results on the research of the design and analysis of search engine algorithms. The contents include: the limiting behavior of PageRank when the damping factor tends to 1; comparison of the convergence rate of maximal and minimal irreducible Markov chains on the Internet; a new proposal of N -step PageRank algorithm; a new proposal of ranking Websites– AggregateRank Algorithm. As is well known that in recent years Web search engines have been more and more important in modern science and technology, and more and more popular in civil daily life. The design of Web search engines has been becoming a focus of the research on the Web search and mining. One popular aspect is to calculate Static Rank by exploiting the hyperlink structure of the Web. Researchers have made great progress on link analysis models and algorithms since 1998, such as HITS and PageRank ([9, 17]). In nowadays, PageRank has emerged a popular link ∗1Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100080, China; 2Graduate University of the Chinese Academy of Sciences; 3School of Sciences, Beijing Jiaotong University, Beijing 100044, China (Email: [email protected], [email protected], [email protected]).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ranking WebPages Using Web Structure Mining Concepts

With the rapid growth of the Web, users get easily lost in the rich hyper structure on the web. Providing relevant information to the users to supply to their needs is the primary goal of the owners of these websites. Web mining is one of the techniques that could help the websites owner in this direction. Web mining was categorized into three categories such as web content mining, web usage mi...

متن کامل

A readability assessment of online Parkinson's disease information.

BACKGROUND Patients increasingly use the internet to access health information. Inadequate health literacy is common and frequently limits patient comprehension of healthcare literature. We aimed to assess the readability of online consumer-orientated Parkinson's disease (PD) information using two validated measures. METHOD We identified the 100 highest ranked consumer-orientated PD webpages ...

متن کامل

Accurate and Efficient Crawling for Relevant Websites

Focused web crawlers have recently emerged as an alternative to the well-established web search engines. While the well-known focused crawlers retrieve relevant webpages, there are various applications which target whole websites instead of single webpages. For example, companies are represented by websites, not by individual webpages. To answer queries targeted at websites, web directories are...

متن کامل

Phishing Detection with Popular Search Engines: Simple and Effective

We propose a new phishing detection heuristic based on the search results returned from popular web search engines such as Google, Bing and Yahoo. The full URL of a website a user intends to access is used as the search string, and the number of results returned and ranking of the website are used for classification. Most of the time, legitimate websites get back large number of results and are...

متن کامل

Webometrics-based Analysis and Ranking of Iranian Hospital Websites

Background and Objectives: Active presence of hospitals on the Internet is becoming a hallmark of hospitals’ commitment to quality healthcare services delivery. For insightful planning towards a strong Internet-based information delivery and communication, there is a need for continuous monitoring of hospital website’s status. Built on this need, this paper provides, for the first time, a ranki...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007